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ABSTRACT 



An improved classification system for classifying charac- 
teristics of an article into classes of characteristics based on 
features associated with the characteristics. The system 
includes a first rule-based classifier that provides rules that 
embody requirements of selected features so as to together 
define selected classes. These rules are applied to measured 
features of an article characteristic to assign one of the 
selected classes to that characteristic, should the article 
characteristic meet the feature requirements of one of the 
selected classes. The system also includes a trainable clas- 
sifier configured based on a training set of preclassified 
characteristics, with the trainable classifier connected so as 
to accept firom the first rule-based classifier an article char- 
acteristic not classified by the rule based classifier. The 
trainable classifier assigns one of the training set classes to 
the xmclassified characteristic, should the trainable classifier 
configuration associate that characteristic with one of the 
training set classes. A second rule-based classifier can be 
connected so as to accept classified characteristics from the 
trainable classifier for making refinement classifications, and 
a third rule-based classifier can be connected so as to accept 
unclassified characteristics from the trainable classifier for 
making classification based on selected classes. 

23 Claims, 6 Drawing Sheets 
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AUTOMATIC CLASSIFIER FOR REAL TIME istics of a particular material being manufactured, the pro- 

LNSPECnON AND CLASSIFICATION cess sequence being followed, and the particular equipment 

being used. Further, machine operators of a given plant 

BACKGROUND OF THE INVENTION frequently prefer definition of more than a general defect 
. . ^ . , j< class: for example, they may prefer to have a severity index 

This mvention relates to techniques for mspectmg and ^ ^ ^ operation assigned to classified defects so 

classifying images, and more particularly relates to auto- appropriate plant personnel can be alerted immediately 

mated classification techmques that enable real time classi- ^ejj ^ ^'killer'' defect appears and so that they arc not 

fication of images. bothered by "nuisance" defects that have no real adverse 

In many operations such as manufacturing processes, it is affect OD the manufactured product, 
desirable to inspect images of articles as the articles move But even within a given manufacturing plant or process 
through a sequence of operations to ascertain if the articles sequence, a wide variety of materials and types of material 
meet quality criteria or contain unwanted defects that require defects may be critically important, and a single rule-based 
special processing. For example, during a process for con- classification system is typically responsible for all such 
tinuous casting of steel, it is preferable to inspect a continu- materials and characteristic defects. This requires that the 
ous steel ribbon or cut strips of the ribbon in real time as they classification system be configured with a capability to 
pass between heating and cooling chambers, whereby ribbon generate a large number of defined defect features such that 
sections or strips found to meet quality criteria can be optimum classification across the variety of materials and 
immediately directed to a next process, e.g., rolling, without defect classes is ensured. As a result, configuring a rule- 
loss of time and without a change in steel temperature. ^ based classification system for a given material can be 
Similariy, processes for producing sheets ofpaper, plastic, or complex in that the system operator typically must select 
other such *Sveb" materials gain benefits in eflBciency firom those features that are preferred for a given material inspec- 
real time quality inspection of the materials during the tion process firom a substantial amount of defect feature data 
manufacmring process; and in general, a wide range of generated by the system. Thus, while the structure of a 
manufacturing and processing operations optimally include ^ rule-based classifier table is generally considered easy to 
provisions for real time inspection of articles during their implement and use, the development of an optimum table for 
manufacture. a given classification operation is in practice often an 

Inspection for minimum manufacturing quality criteria is unachievable goal, 

typically preferably based on classification of aspects of Indeed, experience in the field of real time material defect 

articles such as, e.g., defects of the articles, as a function of, classification has shown that very few manufacturing pro- 

e.g., defect type and severity. In such a defect classification cess users of rule-based classification systems have the 

technique, in the case of a web material, for example, defects ability to properly define an optimum, complete set of 

of the material are identified in images of the material and rule-based table rules for a given material manufacturing 

measurements, Le., features, of the defects are extracted operation, and thus, rule-based classification systems gen- 

from the images. The defects are then typically classified 35 erally do not in practice achieve desired levels of classifi- 

into categories, or classes, such as "scratch," "oil spot," cation accuracy. Further, the binary nature of rule-based 

"dirt," "roll mark," or other named defects, as well as decisions as implemented in a rule-based classifier often 

subcategories, or subclasses, such as "small scratch," or give rise to poor classification repeatability because feature 

"large scratch." measurement data is subject to quantization errors. The 

Conventional systems for real time inspection and 40 binary nature of rule -based decision making is not flexible 

classification, used, e.g., for quality control inspection, typi- because test feature values in a rule-based table are fixed, 

cally are implemented as a rule-based, look-up-table clas- and thus a rule-based test of a feature results in an a priori 

sifier configuration in which feature measurements such as classification decision even if a measured feature value and 

defect feature measurements are compared with pre- atestfeaturevaluedifferby as little as, e.g., 1 percent or less, 

specified table rules for making classification decisions. For 45 In addition, the performance of rule-based classification 

example, Fanl ct al., in U.S. Pat. No. 4,519,041, describe a systems depends strongly on which test feature values were 

rule-based defect classification system employing hicrarchi- selected during development of the rule-based tabic, 

cal rule-based classification logic in combination with Trainable pattern classifiers have been developed which 

syntactic/semantic classification logic to make classification overcome some of the problems of rule -based classifiers 

decisions. Rule-based systems Like this generally employ 50 mentioned above, in particular by providing the ability to 

classifier tables that include a set of logical or numerical achieve relatively more reliable classification without the 

tests for making the comparison between measured feature need for highly skilled personnel to develop an optimized 

values and predefined test feature values or test feature value classification table. This is enabled by the characteristic of 

thresholds. trainable pattern classifiers to have an ability to "learn" from 

Generally, to implement a rule-based classification 55 a set of training samples, which arc articles of known 

system, a system user must manually determine, through classes, how to make classification decisions based on test 

observation, which features are important for defining each features of articles under inspection, 

of the classification categories of interest. In the case of a Trainable pattern classifiers are, however, limited in their 

material manufacturing process, this can be a complex initial classification capabilities in that imtil a trainable 

procedure that requires a skilled engineer or technician who 60 classifier has been presented with a large number of training 

has familiarity both with, e.g., defects of interest to be samples that cover the full range of possible features for a set 

classified for a given material, as well as a good working of classes of interest, a trainable classifier cannot make 

knowledge of statistics and mathematics. The complexity of effective classification decisions. Thus, the optimmn training 

this procedure is compounded by the fact that frequently, a sample set for each intended class must Dot only be large, but 

given manufacturing plant has a unique language and folk- 65 also well-representative of the extent of the class; presen- 

lorc for describing and classifying defects, because the tation to a trainable classifier of even thousands of training 

nature of defects inherently depends heavily on character- samples will not be effective if the samples are similar — a 
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wide range of samples for each dass must instead be 
provided. This is in part because the \ise of relatively 
Qon-unique samples, i.e., samples having features that do not 
well -distinguish between the boundaries of different classes, 
can seriously degrade classification results. 5 

Trainable classifiers generally are also distinguished over 
riile-based classifiers in that trainable classifiers can be 
configured to regard selected possible classes, e.g., defect 
classes, as being more critical than others. For example, a 
trainable classifier can generally be configured such that in 10 
selecting between a first class assigimaent or second class 
assignment for a given defect the classifier will give priority 
to, say, the first class, based on, e.g., a loss value prespecified 
for each class and factored into the classification assignment 
result. While such selectivity is generally quite important for 15 
a manufacturing process, the task of implementing its con- 
figuration can be quite complicated, based on particular 
prespecified dependencies of the loss value, and this com- 
plexity often precludes its implementation. 

Even without the implementation of this particular train- ^ 
able classifier configuration, a great deal of time is typically 
required to identify and gather an optimum set of training 
samples characteristic of a given manufacturing process and 
materials to be inspected in general. When such a training set 
has been accumulated, the training process during which the ^ 
samples are "taught" to the system can reqxiire many hours 
or even days. This heavy training requirement and the 
complicated system configuration requirements are not gen- 
erally reasonable in the fast-paced environment of a manu- 
facturing process. ^ 

PROBLEMS TO BE SOLVED BY THE 
IlsrVENTIOM 

It is a goal of the present invention to overcome the 
drawbacks of the existing rule-based and trainable classifi- 
cation systems by exploiting the strengths of both rule-based 
and trainable classification systems while at the same time 
compensating for their respective weaknesses. It is a further 
goal of the invention to provide a classification system that ^ 
is automatic and that operates in real time, at high speed. It 
is a further goal of the invention to provide a classification 
system that enables ease of system configuration as well as 
superior classification reliabiUty. 

SUMMARY OF THE INVE^mON 

The invention provides an improved classification system 
for classifying characteristics of an article into classes of 
characteristics based on features associated with the char- 
acteristics. The system includes a first nile-based classifier so 
that provides rules that embody requirements of selected 
features so as to together define selected classes. These rules 
arc applied to measured features of an article characteristic 
to assign one of the selected classes to that characteristic, 
should the article characteristic meet the feature require- 55 
ments of one of the selected classes. The system also 
includes a trainable classifier configured based on a training 
set of preclassified characteristics, with the trainable classi- 
fier connected so as to accept from the first rule-based 
classifier an article characteristic not classified by the rule go 
based classifier. The trainable classifier assigns one of the 
training set classes to the unclassified characteristic, should 
the trainable classifier configuration associate that charac- 
teristic with one of the training set classes. 

This classification system of the invention takes advan- 65 
tages of the performance strengths of each of the rule-based 
and trainable classifiers, while at the same time minimizing 



4 

their drawbacks, to provide a classifier that is superior over 
both conventional rxde-based classifiers and trainable clas- 
sifiers. The classifier system of the invention provides reh- 
able and efifideDt classification with an elegant configuration 
that enables efficient classification processes. 

In embodiments provided by the invention, the classifier 
system further includes a second rule-based classifier con- 
nected so as to accept from the trainable classifier a classi- 
fied article characteristic for refining classification of that 
characteristic. The second rule-based classifier provides 
refinement rules that embody requirements of selected 
refinement features so as to define refinement classes. The 
refinement rules are applied to measured features of the 
classified article characteristic to assign one of the refine- 
ment classes to the classified characteristic, should the 
classified characteristic meet the refinement feature require- 
ments of one of the refinement classes. 

The classifier system further is provided by the invention 
to include a third rule-based classifier connected so as to 
accept fi-om the trainable classifier an article characteristic 
not classified by the trainable classifier or the first rule-based 
classifier. This third rule-based classifier provides rules that 
embody requirements of selected features so as to together 
define selected classes. The rules are applied to measured 
features of the unclassified article characteristic for assign- 
ing one of the selected classes to the unclassified 
characteristic, shoxild the unclassified characteristic meet the 
feature requirements of one of the selected classes. 

Here the selected classes defined by rules of the first 
rule-based classifier can include those classes expected to be 
frequently assigned lo article characteristics of interest and 
those classes for which assignment to characteristics of 
interest, based on rigid feature requirements, is expected to 
be accomplished with at least a minimum prespecified level 
of confidence. 

In a further embodiment, the classifier system of the 
invention is configured such that the trainable classifier first 
assigns one of the training set classes to an article 
characteristic, should the trainable classifier configuration 
associate that characteristic with one of the training set 
classes, and a first rule-based classifier is connected so as to 
accept from the trainable classifier a classified article char- 
acteristic for refining classification of that characteristic 
based on refinement rules that embody selected refinement 
features. Here a second rule-based classifier can be con- 
nected so as to accept from the trainable classifier an article 
characteristic not classified by the trainable classifier, for 
assigning one of the selected classes to the unclassified 
characteristic, should the article characteristic meet the 
feature requirements of one of the selected classes. 

In a further aspect of the invention, there is enabled the 
ability to choose features of the preclassified characteristic 
training set of the training based classifier based on a 
comparison of percentage of training set characteristics that 
are associated by the trainable classifier with the class 
known to be correct for each respective training set charac- 
teristic. This comparison is carried out as each possible 
feature is added to a set of possible features, with each 
feature being added in an order begjrming with a feature 
known to provide a highest degree of correctness in asso- 
ciation by the trainable classifier with the characteristic 
training set, based on that feature. Any possible feature for 
which comparison of the association percentage of the 
training set using a current feature accumulation in the set of 
possible features is less than an association percentage for a 
just preceding feature accumulation in the set being is 
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indicated as not to be chosen as a feature of the preclassified inspecting a moving material in a manufacturing process and 

characteristic training set. classifying defects of the material in real time; 

In other embodiments, the invention provides the ability FIG. 3 is a block diagram of an inspection system and an 

to generate measured features of article characteristics to be accompanying classification system in accordance with the 

classified, as well as providing the abiUty to acquire char- 5 invention; 

acieristic information from an article to be examined, for FIGS. 4A-C arc schematic representations of a defect 

generating the measured features. image, an overlay of a pixel array on the defect image, and 

In other embodiments, the rules provided by the first a pixel-based representation of the defect image bounded by 

rule-based classifier are configured as a classification table, an object bouinding box, respectively; and 

with each classification table row corresponding to a ^° piGS. 5A-C are block diagrams of three classifier con- 

sclected class and each column of a given class row corre- figurations provided in accordance with the invention, 
spending to a feature requirement for that class. A plurality 

of rules provided by the rule-based classifier that is con- DETAILED DESCRIPTION OF THE 

nected so as to accept finom the trainable classifier an INVENTION 

unclassified article characteristic relate in one embodiment problem of automatic inspection and classification of 

to a prespecified mmmium degree of confidence of classi- characteristics in an image of an article, such as classifica- 

fication assignment by the trainable classifier. One degree of of material defects in an image of the material, is known 

confidence js provided for each of a corresponding plurality ^e quite difficult. Consider, for example, the classification 

of selected classes, and a correspondmg mmimum degree of defects of a steel strip surface. HGS. lA and IB present 

confidence is required for assignment of a given one of the ^ ^wo gray-scale images of steel surface defects. Each of these 

plurahty of selected classes to the unclassified article char- defect images is that of a hole and both would preferably be 

actenstic. classified as a hole during a classification process. But it is 

In further embodiments, the trainable classifier is config- readily apparent that the two hole defects look very different, 

ured based on probability functions that are constructed ^ Specifically, there is no image transformation, e.g., scaling, 

based on probability of association by the trainable classifier rotation, or gray -level shifting, that can transform the hole 

of each characteristic of the prcclassificd characteristic train- image in FIG. lA into the hole image in FIG. IB. In other 

ing set with a corresponding known respective class for that words, it is not possible to create a template for the class of 

characteristic. Here, at least one feature of characteristics in defects called "hole" that could encompass both the defects 

the training set can be weighted for each class in the training in FIGS. lA-lB. 

set as a function of degree of criticaUty of that feature in ^ In the invention, classification of characteristics of articles 

correct assoaation of a characterisUc with that class. ^ ^^^^ ^ascd on characteristics' features, e.g., in the steel 

In other aspects of the invention the refinement charac- defect example above, is made based on defect features, 

teristic classes can include characteristic subclasses of at rather than raw digitized defect images, to enable automatic 

least one training set class of the trainable classifier; can classification of a broad range of defect types and severities, 

include classes defined by confirmational feature require- Features are here meant as aspects of an article 

ments for confirming validity of trainable classifier class characteristic, e.g., aspects of defects in an image of a 

associations; and can include classes defined by clarification material, that are computed based on measurements 

feature requirements for clarifying a trainable classifier class extracted from the image. Each component of the apparatus 

association between two classes for which confusion ^ and method provided by the invention for analyzing features 

between the two classes is possible. to automatically classify characteristics in real time will be 

In other aspects, the rules provided by the rule -based described in detail below, 

classifier that is connected so as to accept an unclassified Referring to FIG. 2, in accordance with the present 

article from the trainable classifier can be configured as a invention there is shown an example manufacturing process 

classification table, with each classification table row cor- 45 in which the classification technique of the invention can be 

responding to a selected class and each column of a given employed. In the example process, a material, e.g., a steel 

class row corresponding to a feature requirement for that strip 8, being rolled along a manufacturing sequence from 

class, with the classes selected for enabling classification of left to right, is automatically inspected for defects, which are 

article characteristics undefined by another classifier in the automatically classified by the classifier system 10 of the 

classification system. 5q invention configured in an inspection system 12. At least one 

Hic classifier system of the invention can be employed in light source 14 is mounted to aim at the rolling strip 8 for 

a wide range of classification applications for which char- directing a beam of light aCToss the width, W, of the strq) 8, 

acteristics of an article or articles arc to be classified. thereby creating a light stripe on the steel for inspection. One 

Characteristics of continuous articles, e.g., web materials or more cameras 16 are positioned to capture light reflected 

hke steel strips, of discrete articles, e.g., discrete products, of 55 fi'om the surface of the web along the light stripe to produce 

portions of articles, of entire articles, or of data represcnta- a linescan image of the web surface in the light stripe. The 

tivc of an article, can be classified by the classifier system of camera captures successive linescan images as the web 

the invention. moves beneath the camera. 

Further applications, features, and advantages of the As the camera captures linescans, images of the steel strip 

invention wiU be apparent from the claims, and fi'om the 60 are transferred fi'om the camera to an image processor 18, 

following description and accompanying figures. which also sends control signals back to the camera. The 

BRFFF nptirRTPTlON OF THF HD AWiKC^ ^^^^ processor extracts information from the image about 

BRIEF DESCRIPTION OF THE DRAWINGS ^^jects in the image that could represent defects of the steel 

FIGS. lA-B are grayscale images of two different defects strip, specifically extracting numeric features of the objects 

together characteristic of a "hole" defect class; 65 for use in classifying the objects as defects. "Object" is here 

FIG. 2 is a schematic diagram of an inspection system in intended to refer to distinguishable characteristics, in this 

which the classifier of the invention is configured for example case defects, found in the image and for which 
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classification is desired. The potential defects identified in of a process, acquired from a video sequence or other image 

the image arc then classified by the classifier 10 of the acquisition technique, can be classified using the classifier of 

invention based on these features. the invention. 

As explained in detail below, the classifier 10 of the In addition, non-optical infonnation, such as informatioD 

invention provides superior classification performance with ^ relating to aspects of a manufacturing process during which 

an automated classification process that can operate at real articles are being inspected and classified, can be used as 

time speeds commensurate with, e.g., the speed with which input feature infonnation for the classification process. Such 

the steel strip is being moved. Specific details of the clas- infonnation can be generated by, e.g., thermal sensors, 

sifier operation will be provided later in the discussion. moisture sensors, electromagnetic sensors, gauge, width, 

Defect classification infonnation is output fiom the clas- ^° and density measurement devices, or other such sensing 
sifier and can be transferred from the classifier to any devices. Other process-related data produced by a process- 
number of user-defined peripheral equipment for analysis of control computer could also be employed as feature infor- 
the classification information. For example, the classifica- mation. As can be recognized, beyond information relating 
don information can be transferred to a computer, to a manufacturing processing, a wide range of non-optical 
workstation 20, or to other suitable computer processor that feature infonnation can be employed as feature information 
enables operator configuration and control 22 of the classi- in Ihe classification process, with suitable feature extraction 
fication process. In addition, a manufacturing controller modes employed to produce such information 
connected with the manufacturing process, or other Refcning to FIG. 3, a selected mode of feature extraction, 
controller, can be interfaced with the inspection system to e.g., image capture, as well as a selected inspection and 
synchronize inspection and classification process timing ^ classification application, determines the specific acquisition 
with that of the manufacturing process flow. The workstation and processing functions to be performed prior to classifi- 
20 can be provided with software for viewing the classifi- cation of charac teristics of interest f or that applic ation, 
cation results on the workstation display as the inspection "(Jonsideririg an image capture system for opticaTTeature 
takes place; e.g., information about the number, type, and extraction, and as shown in the figure, an image of all or a 
location of defects can be displayed. A printer 24 connected ^ portion of an article 36, captiued by a camera 16 suitable for 
to the workstation 20 can also be used to print out defect the characteristics of the article of interest, is input to the 
data. A workstation operator can then use this information image processor 18 (see also FIG. 2) for extracting infor- 
to, e.g., modify the production process or remove material mation about the article image to be used in a classification 
that docs not meet quality requirements. process in accordance with the invention. 

An additional one or more computer processors such as As can be recognized, conventional camera systems, e.g., 

workstations 26 can further be provided for offline review video cameras such as charge -coupled device (CCD) 

and/or printing, via a printer 28, of defect data. The data can cameras, or other suitable image acquisition systems, e.g., 

also be stored, e.g., on a disk storage unit 30, sent to a remote laser scanners, can be employed in acquirin g an article 

location by way of a modem 32, transfened through a image. Preferably, the camera system selected can acquire 

manufacturing plant network 34 such as a local area network "^SSrSage at a rate commensurate with the speed of, e.g., a 

to other manufacturing operators, or otherwise delivered manufacturing line tmder inspection or in general, the 

and/or manipulated for analysis and control of the manu- desired real time inspection and classification speed, 

facturing process. jl^^ image processor 18 accepts the image signal pro- 

As can be recognized, the automated steel inspection duccd by the camera and produces a conesponding digital 

system described above is but one example of a wide range image pixel array 38, which can consist of, e.g., one or more 

of machine vision systems, and more broadly, classification rows of pixels, as is conventional, each of a digital image 

systems, in which the classifier of the invention can be intensity value and that together are representative of, e.g., 

configured for enabling real time, automatic classification. a grayscale or color version of the acquired image. Then, as 

In general, the classifier of the invention requires only that 45 is conventional, the image processor analyzes the pixel 

a machine vision system or other data acquisition system in value data to identify 40 objects in the acquired image and 

which the classifier is to be configured be capable of to generate 50 feature information for the identified objects, 

producing numeric feature information describing charac- "Object" is here intended to refer to distingxushablc char- 

teristics of articles for which classification is desired. acteristics found in the image and for which classification is 

Aside firom aspects of continuous articles sudi as web 50 desired. Identification of objects in the image can be accom- 

materials like the steel strips described above, aspects of phshed by any of a wide range of well-known and conven- 

discretc articles, such as products, e.g., computer disks, tional image processing techniques, including, e.g., pixel 

plastic components, or indeed any other discrete articles for value normalization, pixel value thresholding such as level 

which feature information can be acquired and analyzed, can or edge thresholding, pixel value filtering and/or merging, or 

be classified by the classifier of the invention. For example, 55 other image processing techniques, 

in the case of a machine vision system, images of entire Preferably, the result of the image processing, whatever 

discrete articles can be captured using, e.g., an area camera its implementation, is generation of features, for objects 

in a frame grabber configuration, as is conventional, or for identified in the image, that are sufficientiy descriptive of 

larger articles, an image of an article portion or portions of classes of interest of the objects, e.g. classes of defects, such 

interest can be captured. ^ that differentiati on between classes can be acc omolished-bv 

It is thus seen that the classifier of the invention is not t he" ci assifierT^ut because, as explained earlier, various 

hmited to a particular type or size of article for which cTasSBcaHon applications rely on various features, it is often 

classification is desired, and indeed can be implemented for difficult to determine a priori for a given classification 

classification not only of articles but also of image data such operation which features of a set of features will be impor- 

as text and graphics data acquired from a document of 65 tant for that application. Hius, in accordance with the 

interest and stored in computer memory. Likewise, charac- invention, it is preferred that if a single implementation of 

tcristics of images of articles, data, or patterns representative the classifier of the invention is to be used for various 
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classification appHcations, then the image processor with 
which the classifier is configured shotild be capable of 
generating a wide variety of features, whereby reliable 
classification in all of the various applications is ensured. 

Feature information can be generated by the image pro- 
cessor using software, special purpose hardware, or other 
processing configuration, based on speed reqtiirements for a 
selected system implementation. Whatever the 
X implementation, the feature generation process is accom- 
^ plished to derive positional, shape, size, and optical ^^i- 
( acteristic measurements for objects identified by the image 
^processor. In some applications, it can be preferable to 
^^acilitate such measurements by producing coordinates for 
defining a bounding box that is the smallest rectangle 
possible which can completely enclose all pixels associated 
with a single identified object Based on this operation, and 
referring to FIG. 4A, an object image 44, represented by a 
pixel array 38 (in an idealized manner in FIG. 4B), is defined 
by the Image processor as an object image 48 having a 
boimding box 55 outlying its periphery. 

With bounding boxes defined for identified objects, 
positional, shape, and size feature measurements can then be 
readily derived by examination of the extent of the pixels 
representing the bounded object Beyond positional features, 25 
other features relating to optical characteristics of the iden- 
tified objects can additionally be generated based on the 
grayscale values of pixels associated with the objects. Fea- 
tures relating to "moments" of an object, by regarding pixel 
gray levels as the object "mass," can also be generated. As 
can be recognized, other types of features can additionally 
be generated, for example, frequency of object occurrence, 
conditions of an imaged article in the vicinity of an identified 
object, or other such features. 

Feature measurements can be, e.g., boolean, having only 
the logical values of "true" or "false" assigned to them, or 
can be, e.g., numeric. Numeric feature values are generally 
stored as floating point numbers, as they may require a 
decimal point, and boolean features can be represented 
internally as a "1," or "0," for example. Table I below lists 
and describes example features to be generated by the image 
processor for use with the classifier of the invention in a 
material defect classification application. 



10 



TABLE I-continued 



Feature 



Feature Definition 



^ from Right 
Edge 

Box Beight 
Box W^dth 
Box Area 
Box Peximeter 
10 TbtalArca 



Perimeter 

Height/Width 

Ratio 

Area Ratio 



30 



35 



40 



TABLE I 



45 



Feature 



Feature Deflnitioa 



Box Left 



Hie cross 'Web (X-orientation) distance from a fixed 
nieasuriDg point to the left side of the bounding box. 
The cross -web distance from a fixed measuring point to 
the right side of the bounding box. 
Ttie dowD-web distance from the start of inspection to 
the top of the bounding box. 

The down- web distance from the start of inspection to 
the bottom of the bounding box. 
The cross -web distance from a fixed measuring point to 
the center of the bounding box. 
llie down-web distance from the start of inspection to 
the center of the bounding box. 
The cross-web position from the left edge of the 
inspected material to the left side of the bounding box. 
The cioss-web position from the right edge of the 
inspected material to the left side of the bounding box. 
Box Right &om The cross-wcb position from the lefl edge of the 

Edge inspected material to the right side of the bounding box. 
Box Right from The cross-web position from the rigM edge of the 
Right Edge inspected material to the right side of the bounding box. 
Box X Center The cross -web position from the left edge of the 
from Left Edge inspected material to the center of the bounding box. 
Box X Center The cioss-web position from the right edge of the 



Box Right 

Box Top 

Box Bottom 

Box X Center 

Box Y Center 

Box Left from 
Ua Edge 
Box Left from 
Right Edge 



55 



65 



inspected material to the center of the bounding box. 

The height cf the bounding box. 

The width of the bounding box. 

Box Width " Box Height. 

(2 ' Box Height) + (2 • Box Width). 

The area of the thresholded pixels in the object That is, 

the number of thresholded pixels in the object, scaled by 

the pixel resohttion. 

The perimeter cf the object. 

Box Height / Box Width. 



Tbtal Area / Box Area 
Perimeter Ratio Perimeter / Box Perimeter. 

Low Grayscale The grayscale value of the darkest thresholded pixel in 
the object. 

ffigh Grayscale The grayscale value of the lightest thresholded pixel in 
the object 

The average grayscale intensity of the thresholded pixels 
of the object 

High Grayscale - Low Grayscale 



Average 
Grayscale 
Grayscale 
Range 
R^eating 
Defect 



Boolean feature indicating a series of similar defects 
recurring at a regular interval. 



The above descriptions of features to be generated by the 
image processor are exa mples of features for use in a defect 
classification apphcation and are' meant as a representative, 
btll hoi compieteTlist^f possible features for use in classi- 
fication processes in general in accordance with the inven- 
tion. As can be recognized, there is no hmit to the number 
and type of features that may be useful in the wide range of 
classification appUcations to which the invention is directed. 
The features described above provide an example of the 
range of types of features that are important for enabhng 
efficient and reliable classification in one classification 
example. Indeed, as explained previously, a wide range of 
non-ophcal features can be employed, and no specific mode 
of feature extraction is required. 

Refcning again to FIG. 3, after features have been gen- 
erated 50 by the image processor 18, the classifier 10 of the 
invention is invoked to classify the identified objects based 
on the feature information for those objects. The classifier 10 
includes four classifier modules, namely a first rule-based 
classifier 54, hereinafter called RBCl; a training-based 
classifier 56, hereinafter called TBC; a second rule-based 
classifier 58, hereinafter called RBC2; and a third rule-based 
classifier 60, hereinafter called RBC3. 

In accordance with the invention, the configuration of 
these four classifier modules together as a classifier is 
flexible; specifically, the modules selected to be included in 
the classifier and the object features analyzed by each 
selected module can be reconfigured over time. This con- 
figuration flcxibiUty is provided to exploit the implementa- 
tion and performance advantages of each module while at 
the same time compensating for the complexity and perfor- 
mance weaknesses of the various modules. 

All three nde-based classifier modules, RBCl, RBC2, and 
RBC3, can be implemented based on any of a wide range of 
classification configurations that make classification deci- 
sions based on defined rxdes. In one example configuration 
in accordance with the invention, the rule-based modides 
embody classification tables, with each row of a given 
module's classification table being assigned to a given class, 
e.g., in the case of defect classification, a defect class called 
"scratch." The columns of a class row of the table specify for 
that class various limits on features characteristic of that 
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dass that in the dassificatioD operation are compared with only be defined and formatted into a desired dassification 

corresponding measured features of an object xmder consid- table structure to enable a classification process. However, in 

eration; for example, a feature test of "minimum bounding practice, the actual implementation of the classification table 

box height" is an example rule-based feature test to be is often time consuming and diflScult. A further weakness of 

applied to an object under consideration. An object is 5 rule-based classifiers stems from the binary nature of the 

determined to belong to a class assigned to a given row of feature test limitations used in a rule-based table. Because 

the tabic if the measured object features meet each of the ^he feature test Umitations set specific, hard, limits, they 

feature limits in the columns of the dass row. accommodate decision flexibihtr, m other words, 

^ L - . - . LI - , . 1^ i even a very slight variance of an object feature fi-om, e.g., a 

In this clarification table implementation, each mle- ^^^^ ^^^^^ measurement range, results in a negative 

based table dassifie r module daffies an object based on lO ^^ ^^^ ^^^^^ measurement range. Hie mle- 

ciRSsincMion niieft specmc to mat module bLijem,^ based classifiers are thus not responsive to even very shght 

^gh the cMca^ion table miplem^ e cjcd i n tha mo^ e^ distinctions between a test and measured feature. Distal 

kegmning with the first class row of the able the module ^^^^^^^^ errors in image data acquired during the inspec- 

compares measured feature mformationcd^^^ tion process accordingly camiot be accommodated and can 

the columns of the row with the specified feature Imiits, one 15 ^^^^^ ^ ^^oncous rule-based classification results, 

column at a time. Upon reachmg a first column feature . ^ ^- ^ ■ i_i i c . . i l j 

c«*^;fi^-.t,-„„ tu.t ti,^ «uUot ..«^r, ^™;^«,,t;«« Automahc, trainable classifiers that arc not mle -based are 

specification that the object under consideration fails to * * a li i_ • * j . 

meet, e.g.; by having less than a minimum required bound- f """"^f' ^'''^"^ "^^J* '° '^'V .ac«,mmodate 

ing box height, it islssumed that the object does not belong deviations since they do not make ngid decisions 

. I t ^ *u . . ui .u . f .u 1 t * based on a defined feature limitation or range. Because 

to the class fof that table ro\^, the rest of .the columns of that ^i,, , , -i . 

, . J J .u 1 »• rule-based automatic classifiers are similar to user- 
row are then skipped over and the classification process « j i u j i • *u * *u l *i. 

^ ^ r *u * ui rr .u « * Configured rule-based classifiers in that they both arc gen- 

contmues at the next row of the table. Upon the first „ . . • j • n • • i j • 

c . ,| - J f ^ -c . . erally characterized as inflexible m their classification dcci- 

occurrenceofpassing all required feature speaficauon tests . ^ i i_ j . , •£ 

e ' r r- .1- * . LI ' • * J. *L sion process, rule-based automatic classifiers are not 

of a given row, the class for that table row is assigned to the ^„ r \ auiuiijauiv. wa^iiivi^ ai^ uui 

u- \ J *u 1 -c »• • 1 . J f .u 1 L J o« generally preferred m the mvention but can be employed 

object and the classification IS completed for the rule -based ^ , fii t , • i -t . 

classifier. If upon reaching the end of the table, i.e., after all 1^^"^ ^ ^ \ ^f^l T"^ }^'' 

1 u u J 1 I. u • J invention of a trainable classifier that is not rule-based, a 

class rows have been addressed, no class has been assiRoed, . .-t- . - • . i i -n - i . 

u* . • -1- * -1 . u 1 J probabilistic tnuning-bascd classifier is employed to asso- 

then the object is indicated to be imclassmed. ^ . - ^ viT i i_ . . r 

■' ciate an object with a class by assignmg a probabihty of 

As can be recognized, many implementations alternative likeness between test features and measured features for 

to this table-based implementation for the rule-based das- ^very possible class to ascerUin a highest dassification 

sifiers are suitable. As the functionality and inter- probability, rather than making a binary, rule-based dassi- 

rdationship of the dassifier modules is discussed in dctaQ ^^^^^^ decision feature by feature. This flexibility comes at 

below, considerations for such alternatives wiU be provided. ^ost, however, of the need for an extended sample set for 

ne training-based classifier module of the classifier is not ^5 dassification training, 

sed on an a priori, user-configured rule implementaUon, inventors herein have recognized that the advantages 

nd instead is an automatic dassifier that makes classifica- and weaknesses of user-configured, rule-based classifiers 

f/tion decisions based on its previous training. The training- and automatic, training-based classifiers are complementary, 

pased dassifier can be implemented as any of a wide range and based on this recognition have structured the configu- 

of automatic, trainable classifiers based, e.g., on probabihs- ^ ration of the dassifier of the invention to exploit the advan- 

tic functions, interconnected neuron functions in a neural tages of the various classifier modules whQc at the same time 

network, automaticaUy-generated rules, or other suitable minimizing their drawbacks, to provide a dassifier that is 

automatic classifier oonfiguraUon. superior over both conventional trainable and user- 

In one example in accordance with the invention, the configured rule-based classifier systems. In the invention, 

training-based, classifier module is implemented as a set of 45 each class in the set of possible object classes is relegated to 

class probability functions that are based on a training set of a given one or more of the classifier modules based on 

classes for which the dassifier is said to be "trained" prior aspects of that class and the classification process of each of 

to the classification operation. Each class probability func- the modules. Over time, as the classifier system is used to 

tion defines, for a given class, the probability of an object classify objects, the dassifier modules can be reconfigured 

belonging to that class as a function of the probabiUties of 50 and one or more classes originally handled by a given first 

various object features selected to represent the class. As classifier module migrated or additionally delegated for 

explained in detail below, the training process develops the handling by a second of the modules, 

probabilities of a given feature being assodated with each In one example scenario in accordance with the invention, 

class of the training dass set. These feature probabihties are referring to FIG. 5A, the classifier 10 of the invention is 

then employed during the classification process to assodate 55 originally configured, using only RBCl 54 to classify 

an object with one of the classes by ascertaining the prob- objects/TTie RBrri classifies ohjects input tnj t, and outputs 

abihties of the object belonging to each of the training set the objects with assigned classes or with an indication ol no 

classes based on the features of the object, with the class to classification,' Preferably, the classifier of the invention is 

which the object has a highest probability of belonging then implemented with an initial, basic RBCl configuration that 

being assigned to the object by the classificafion operation. ^ enables some degree of classification without the need for 

As discussed in the backgroimd section, both user- any required operator configuration. For example, a basic 
configured ruie^6agga'*tlSgSI fi£rs~ang"'au tolmitic, trauning^^ rule-based table classifier cari be provided that'distinguishes 

^ased classmers have particular ativantages and weaknesses. betwcicn several important classes for a variety of applica- 

I Tiese a dy anta ges,and.weakness6sarctie^^ lions. 

tation and operation of the classifiers. User-co^^ureSTuIe-es In an example RBCl set-up dassification table provided 

based.classificFS=aTC-rclativeiy1incomplicated to' implement by the system for material defect dassification, defects are 

in that specific feature limitations for classes of interest need classified based on Limitations for features including total 
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area, low grayscal e, high grayscale, and he i ght /width_ ratio . 
Table II below lists this example set-up classification table 
for RBCl. 

TABLE II 





Area 


Minimum 


Maximum 


Height/Width 


Defect Class 


(sq. mm) 


Grayscale 


Grayscale 


Ratio 


Tiny & daik 


<3 




<128 




Tiny & light 


<3 








Small & dark 


A~9 




<128 




Small & light 


4-9 


^128 






long/th in/dark 






<128 


>10 


long/thut/light 




^128 




>10 


Medium & dark 


10-99 




<128 




Medittm & light 


10-^ 


^128 






Large & dark 


100-999 




<128 




Large & light 


100-999 


^128 






Buge & dark 


>1000 




<128 




Buge & light 


>1000 


S128 








maintained in the RBCl, whereby the TBC processing 
power is economized for those classes that arc more difficult 
to assi^. 

Accordingly, classes for which the binary nature of rulc- 
5 based classification is less desirable are at this point rel- 
egated to the TBC for classification by the TBC. As indi- 
cated by the figure, objects that are not classified by the 
RBCl are passed to the TBC for classification by this 
module. The TBC can also be implemented to dupUcate one 
jQ or more c^ajvses nf the RRCl f or increasi ng classificatio n 
reh ability . As mentioned above, as a practical matter, rel- 
egation of classes to the TBC is accomplished by training the 
TBC with a corresponding training set for those classes. 
The second rule-based classifier, RBC2, is here imple- 
j5 mented to accept classified objects from the TBC for refining 
of the classification. In a first example classification refine- 
ment in accordance with the invention, the RBC2 is imple- 
mented with a classification table that defines subclasses of 
the main classes defined by the TBC, e.g., *long scratdi," 
and "short scratch," both of the main class "scratch." Any 
objects whose assigned main class does not have defined 
subclasses in the RBC2 are output by the RBC2 with only 
the main class assigned by the TBC, while those objects for 
which refined subclassification is possible are output by the 
25 RBC2 with the corresponding subclassification. 

The invento rs herein h ave recognized that implementa- 
t ion of subclassilication tunctionality in the RBC2 , rather 
th an in the TBC , is partic ularly efficie pt, i n that it is more 
convenient to accummate a traimng set for a main class. 



20 



important for a given application and what features are 
important for distinguishing between the classes of interest. 
The RBCl implementation, e.g., as, a rule classification 
table, is correspondingly refined to reflect the classification 
^ data as it is collected. 

As use of the inspection and classification system of the 
invention continues, classification results can be stored for 
continual review and analysis by a system operator to 
augment the classes and features of the RBCl rule-based 



As can be recognized, other rule classification tables, and 
other types of user-configiu:ed rule-based classifiers can be 
provided with the classifier system for initial use in the 
RBCl module at the time of system initialization to elimi- 
nate the need for operator configuration. 

After system initialization using a basic RBCl classifiej 
i mplementation provade d by the^sys^tein, the system user can 
modiiy tte RBCl irnplementatioo based on collected clas- 
sification data. The collected data thereby enables the user to 

make informed decisions about what kinds of classes are ^ fathe r tKan subclass es! Accumulation of traimog sets for 

.„ subclasses can be difficult because, e.g., the distinction 

betwieen two subclasses is a subjective judgment of the 
system operator, and also because the system operator might 
misclassify objects based on fine subclass distinctions — for 
35 example, a given object might be thought of as "large" when 
reviewed among a group of relatively smaller objects, but 
thought of as "smair' when reviewed among a group of 
relatively larger objects. Conversely, in the RBC2 
imp lementation, a finely disting uishable subclas s featur e 
classifier module. As operation continues, a sample set of 4oTimit can be a priori set withotTrUias^from review of a 
object examples that have known classification assignments particiUar sai3ipirgroupyand"can be^a'ter"e'^i^y aSjuste'd^as 
is preferably assembled by the system operator for training cIS^ficarioiTresulissuggest. This enables very effective yet 
the training-based classifier TBC 56. When a reasonable uncomplicated subclass definition. 

number of such samples have been acciimiilated the classi- in a further example of classification refinement in accor- 
ficr 10 can be reconfigured, as shown in FIG. 5B, to include 45 aance with the invention, the RBC2 classifier module can be 
all possible classifier modules; namely, the RBCl 54, the configured to enable, in addition to subclassification, con- 
TBC 56, the RBC2 58, and the RBC3 60. This configuration firmation rules that can override a doubtful classification 
is like that shown in FIG, 3. made by the TBC. Here the RBC2 implementation, e.g., a 

This configuration enables classification in the following classifier look-up table (LUT) is implemented with table 
manner. Object classes, e.g., defect classes, for which a so rows based on, e.g., "if/then" statements. For example, if an 
suitable number of feature samples have been accimiulated, object classified by the TBC as belonging to the class "A" 
are used to train the TBC off-line with those samples, is known to just as likely instead belong to the class "B," 
whereby the TBC is designated to analyze a object with then the RBC2 LUT can be implemented with a row tliat 
respect to the corresponding trained sample classes, and the specifies a feature test which strongly suggests class "A." If 
RBCl is not employed for classification assignment of the 55 the feature test is "passed," then the object is assigned to 
classes included in the training set. However, the RBCl belong to class "A;" otherwise the object is assigned to 
implementation, not the TBC implementation, here prefer- belong to class "B." In addition, if the TBC is configured to 
ably includes at least those classes that can be very easily provide to the RBC2 the probability associated with the 
recognized by the binary nature of the rule-based RBCl, class for which the object was assigned, the RBC2 can be 
such as edge defect classes and classes pertaining to specific 60 configured to test the probabiUty against a preset probability 
material locations, e.g., a first few feet of material in a threshold, with class assignments having probabilities less 
continuous roll of material. In addition, classes that can be than the threshold being invalidated, i.e., the object being 
defined by very specific geometric criteria, e.g., size indicated as unclassified. The RBC2 can here also be used 
requirements, are best handled by the RBCl. This is because for risk minimization by applying class-specific probability 
RBCl can very efifidently and effectively- classify such 65 thresholds assigned on the basis of the criticality of that 
objects. Further, classes that are expected to be commonly class. With this configuration, the flexible classification 
identified in a given application are also preferably here process of the TBC is augmented by the more rigid classi- 
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fication process of the RBC2 to definitively classify an 
object that is frequently misdassified. 

The third rule-based classifier, RBC3, is here imple- 
mented to accept unclassified objects from the TBC for 
making classification decisions for those objects. In one ^ 
example RBC3 implementation, those classes for which 
TBC classification is preferred but an adequate sample set is 
not yet accumulated are moved from the RBCl to the RBC3. 
Because the TBC is more precise than a user-configured 
rule-based classifier in its classification decisions, it is 
preferable that the TBC first make classification decisions, 
with those objects not classified by the TBC then considered 
by a rule-based classifier such as the RBC3. 

In a particularly advantageous implementation, the RBC3 
can be configured to provide a converse functionality to the 
classification invalidation function of the RBC2. 
Specifically, if the TBC is configured to provide to the RBC3 
one or more of the top dass names and corresponding 
classification probabilities for an object that that was not 
classified by the TBC due to, e.g., the top class probabilities ^ 
falling below a preset probability threshold, then the RBC3 
can revisit the classification decision on a class by class 
basis. For example, a LUT configuration of the RBC3 can be 
implemented with a rule whereby for a TBC-unclassified 
object which has a specified top TBC class assignment and ^ 
a specified minimum probability associated with that 
assignment, the unclassified object is assigned the top TBC 
class assignment. As can be recognized, this type of rule 
effectively enables application of class-specific probability 
thresholds to the TBC classification process; in general, only ^ 
a single, universal probability threshold can be supplied by 
the TBC due to the nature of the TBC decision making 
process. The RBC3 thus enables consideration of the spe- 
cific nature of each class, as it relates to a corresponding 
probability threshold known to be characteristic for that 
class, in a manner that cannot generally be accomplished by 
the TBC. Objects that are classified by the RBC3 are output 
with a corresponding classification, while objects that are 
Qot classified by the RBC3 are output with a corresponding 
indication. ^ 

With this overall classifier implementation, objects input 
to the RBCl are analyzed with respect to the group of 
classes efficiently handled by a rule-based classifier, with 
those objects for which classification is here possible being 
output by the RBCl with an assigned classification and 
those objects that remain unclassified being output to the 
TBC. The TBC then analyzes the remaining objects based 
on its training set, making very flexible and precise classi- 
fication decisions. Those objects that are not classified by the 
TBC are output to the RBC3, which is implemented to 
include all classes not handled by the RBCl and the TBC. 
RBC3 analyzes the remaining unclassified objects and out- 
puts either a classification assignment or an indication of no 
classification to any of the objects it processes. 

Objects that are classified by the TBC are output to the 
RBC2, which is implemented to make refined subclass 
decisions for the classified objects. As explained earlier, 
objects for which no subclass assignment is possible are 
output by RBC2 with their main class assignment from the go 
TBC, while objects for which a subclass assignment was 
possible are output with an assignment of a refined class. 
This classifier configuration results in extremely efficient 
and fast classification that exploits the performance 
strengths of each of the classifier modules. 55 

In one example implementation of this configiuration, an 
array of feature values for an object is input to the RBCl, 
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which outputs, for classified objects, a class number that is 
an index into an array of possible class names. A prcsped- 
fied index, e.g., "0," is set to represent unclassified objects. 
The array of feature values for such unclassified objects is in 
turn passed to the TBC, which makes a classification deci- 
sion for the objects and outputs a class number for TBC- 
made dass assignments, and the index indicated for no 
classification for unclassified objects. The TBC also adds 
four values to the feature array, namely, the indexes of the 
two classes with highest probability for the object, and the 
value of each of the two probabilities. Objects that are 
unclassified by the TBC are passed to the RBC3, and objects 
that arc classified are passed to the RBC2. Hie two highest 
class indexes and probabilities arc employed by the RBC2 in 
making refined, clarifying classification decisions, and are 
employed by the RBC3 in making class-specific, threshold- 
based classification dedsioos, both as dcsaibed above. The 
TBC class number is output by the RBC2 if no classification 
refinement is accomplished by the RBC2, while an alterna- 
tive class number is output by the RBC2 if indeed classifi- 
cation refinement is achieved. The RBC3 operates like 
RBCl, outputting a class number or an index representing 
no class. 

As object examples are continually collected during sys- 
tem operation and are found to correspond to those classes 
being handled by the RBC3, the TBC can be trained with 
those class samples to relieve those classes fiom the RBC3. 
Referring to FIG. 5C, eventually all of the classes handled 
by the RBC3 can be transferred to the TBC, given that 
adequate feature samples for the classes are collected, 
whereby the classifier 10 now can be configured with the 
RBCl implemented with the classes most efficiently ana- 
lyzed by it, with the TBC implemented to analyze all classes 
not handled by the RBCl, and with the RBC2 implemented 
to make refinement classification decisions for the TBC 
classes. 

In many applications, however, it may be preferable to 
maintain the classifier configuration as shown in FIG. 5B 
with tbe RBC3 mle-based module employed with at least the 
initial RBCl basic classification implementation, e.g., as 
given in Table II above. This configuration enables at least 
a broad classification of new and unknown classes that turn 
up as the classifier is operated over time, e.g., when new 
types of defects appear in a material manufacturing process 
due to some sudden change in the process. In this manner, 
new defects that cannot be recognized by the RBCl or the 
TBC can be generally classified by the basic classification 
implementation of the RBC3, whereby manufacturing 
operators can be alerted to severe or important conditions. In 
addition, it may be preferable for many applications to retain 
the class-specific, threshold-based classification functional- 
ity of the RBC3 to augment the TBC-bascd classification 
decisions. 

With this understanding of the classifier operation, gen- 
eral considerations for the range of classifier module imple- 
mentations can be revisited. Specifically, for the RBCl, any 
rule-based classification technique can be employed that 
accommodates the possibility of unclassified objects. This 
characteristic is necessary for identifying the objects that are 
to be classified by the TBC due to the inability of the RBCl 
in classifying such objects. Beyond classifier tables, rule- 
based classifiers such as decision trees and a wide-range of 
other preconfigurable, rule-based dassifier configurations 
can here be employed. For the TBC, any automatic classi- 
fication technique can be employed, preferably that, like the 
RBCl, accommodates the possibility of undassified objects. 
If the TBC implementation does not accommodate unclas- 



07/17/2003, EAST Version: 1.03.0002 



6,0! 

17 

sified objects, then the RBC3 cannot be used. While this 
configuratioD can be nscd, it is not preferable because it 
requires that any classes for which a training set is not 
accumulated must be resident in the RBCl. In many 
applications, e.g., material inspection, it is frequently not 
possible to accumulate an adequate training set, and new 
defects can be expected due to manufacturing changes. 
Thus, it is preferred that the TBC implementation accom- 
modate the use of the RBC3 for maintaining some classes by 
the RBC3. 

Preferably, the TBC implementation acconunodates the 
identification of at least the two highest probabiUty class 
indices and their corresponding probabilities, i.e., 
confidences, for enabling refined classification by the RBC2 
and threshold-based classification by the RBC3. If only the 
single most probable class index is passed from the TBC to 
the RBC2, then only subclassification refinements can be 
accomplished by the RBC2, but the RBC2 can with this 
information make class-specific threshold-based classifica- 
tion decisions. As can be recognized, however, other infor- 
mation or a great deal more probability and/or class infor- 
mation can be produced by the TBC classification process. 

Probabilistic automatic classifier configurations for the 
automatic classifier can be implemented as a B ayes-based 
configuration, a minimum distance configuration, a normal 
distribution configuration, probabilistic neural network 
configuration, or other suitable probabilistic configuration. 
Aside from probabilistic classifier configurations, back- 
propagation-based configurations such as back-propagation 
neural networks, radial-based configurations such as nearest 
neighbor or restricted coulomb energy configurations, or 
other suitable configuration can be employed as the auto- 
matic classifier. While not preferred, the automatic classifier 
implementation can also take the form of an automatic 
rule-based system, where suitable. 

For the RBC2, any ride-bascd classification technique that 
can accommodate unclassified objects can be employed; 
here objects not classified by the RBC2 are assigned the 
class given them by the TBC. For the RBC3, any rule-based 
classification technique can be employed, generally without 
restriction. 

In a preferred embodiment, the classifier system of the 
invention is configured to enable a feature selection process 
whereby the optimum features for a given classification 
application are identified. In one such process provided by 
the invention, an accumulated set of features is first ranked, 
by classifying aU samples in a training set with the TBC for 
each feature alone in turn. The rank of a given feature is then 
defined as the percentage of samples that were correctly 
classified using only that feature. The ranked features are 
then sorted by rank in decreasing order. Table V below 
provides an example ranking for a five-feature set under 
consideration. 



TABLE III 



Rank Number 


Featur& 


Rank 


1 


Low Grayscale 


79.65 


2 


Box Wdlh 


40.62 


3 


Total Area 


39.83 


4 


Perimeter 


11.47 


5 


Avciagc Grayscale 


3.51 



Then ranked features having a ranking below a preset 
threshold are discarded. In the scenario illustrated in Table 
III, e.g., use of a threshold of, e.g., 5% of the sum of all 
ranks, results in discarding of the feature called "Average 
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Grayscale.'' Next in the process, an iterative compilation of 
a selected feature list is undertaken. The first feature in the 
list is that with the highest rank, firom the ranking list, and 
the rank of the first feature in the list is defined to be the 

5 cunent maximum ranking. With this designation, the first 
feature is indicated to be a valid feature. Then, starling with 
that highest feature and the next highest feature of the sorted 
rank table, all of the samples in the TBC training set arc 
classified using both features. If the percentage of samples 
classified using the two features is higher than the initial 
maximum percentage rank, then the second feature is indi- 
cated to be a valid feature along with the first feature, and the 
new percentage is marked as the current maximum ranking. 
This process is then repeated, adding the next feature in the 
sorted ranking table, until all features have been included in 

^5 a classification operation. 

If, in any of the iterations as features are added to the 
selected feature list and a classification operation is 
completed, the percentage of correct sample classifications 
is less than the current maximum, then the most-rcccntly 

20 added feature to the list is indicated to be invalid, and the 
current classification percentage is not stored as the current 
maximum percentage. After aU features have been included 
in the list and a corresponding classification operation, all 
features indicated to be valid are designated as those features 

25 optimally employed in the given classification appUcatioo 
corresponding to the training set employed during the itera- 
tive process. The selected feature set is then supplied for 
configuration among the various classifier modules, and the 
features indicated to be invahd are discarded. Table IV 

30 below outlines characteristics of the resulting feature list as 
this process is iterated. 



TABLE IV 



35 





Features 


Classification 


Feature 


Updated 


Iteration 


Included 


Rank 


Accepted? 


Featme list 


1 


1,2 


98.91 


yes 


la 


2 


1,2^ 


99.35 


yes 




3 


1,2^,4 


99.35 


no 


1,2,3 



40 

Because features which are directed to similar properties can 
give similar classification results, it is possible that a feature 
considered in a later iteration of the list makes a previously- 
considered feature or features useless for enhancing classi- 

45 fication success. To avoid this condition, it is preferable that 
an additional round of classification operation iterations be 
performed. Here, starting with the second to the last feature 
in the selected feature list, with the last feature being valid 
by definition, a TBC classification of the training set is 

50 performed using all but that feature. If the percentage of 
correctly classified samples is greater than or equal to the 
cunent maximum, then that feature is indicated to be not 
valid and is discarded from the set. This process is repeated, 
working through the feature list, until each feature has been 

55 considered. T^le V below outlines characteristics of the 
resulting feature list as this process is iterated. 



TABLE V 



60 













Updated 




Features 


Feature 


Qassificatioa 


Feature 


Feature 


Iteration 


Included 


Excluded 


Rank 


Exduded? 


list 


1 


1,3 


2 


99.35 


yes 


3,3 


2 


3 


1 


39.83 


no 





65 

This feature selection process identifies a set of features to 
be used by the classifier that maximizes classification a ecu- 
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racy and mmimizes the number of features used, based on a 
defined training sample set for the TBC for the given 
application. The selected features are by definition the most 
preferable features to be employed by the TBC, but 
additionally, are useful in identifying important features to 
be included in the RBCl, RBC2, and/or RBC3 classifier 
modules. Once important features are identified, specific 
feature tests can be developed for use in the rule-based 
classifier modules, and features identified as being not 
important and that were previously implemented in the 
rule-based classifier modules can be removed fiom those 
modules if desired. 

Turning now to specific implementation of the rule -based 
classifier modules, in one example, as described earlier, each 
rule-based classifier module is based on a classification table 
in which each row is assigned to a given class with columns 
of that row specifying feature tests which all mxist be 
"passed" by an object for that object to belong to that class. 
Feature values and range values can be, e.g., numeric or 
boolean. Range limits for a given feature can include a 
minimmn value, a maximum value, or both. Numeric feature 
values, e.g., "minimtim bounding box height," are stored as 
floating point numbers in a LUT formatted in memory. 
Boolean feature values, e.g., "repeating defect," are stored in 
the LUT vnth, e.g., a "1" value indicating "true," and a "0" 25 
value indicating "false." 

To maximize speed and ef5ciency of the classifier, it is 
preferred that the classification LUT be defined such that the 
smallest number of class feature tests need be performed 
before a classification is made. To this end, it is preferred 30 
that whenever possible, the features used to define the 
classes be selected as those that are generally consistent for 
all objects of a given class. Of course, not all features for a 
given class are tiseful for classifying every object. For 
example, in the case of defect classes, many defects are not 
location-dependent, and so location features e.g., are not 
particularly useful in classifying defects. Additionally, some 
features that identify a specific type of defect may be more 
imique than others. For example, there may be several types 
of defects with a common width range, so that although 
width may be a necessary feature in identifying these defect 
types to distinguish them fi-om, e.g., other similar defects 
that fall outside the width range, it is not alone sufficient to 
distinguish the defects. 

Based on this consideration the feattire tests of a given 
class row are preferably specifically ordered with the most 
unique feature first and the most common feature last. 
Because the first failure of an objea to meet a feature test in 
a given row results in abandonment of that class row in 
making a classification decision, this ordering enhances 
speed and efficiency of the classification process. Similarly, 
the rows of classes in the LUT are preferably ordered based 
on the frequency of occurrence expected for the classes in a 
given application. The class rows are here ordered from that 
class of highest expected occurrence to the least common 
class. This results in fewer number of class row examina- 
tions prior to classification. Additionally, it is preferred that 
classes known to be more critical for a given application be 
located relatively higher in the table than less-critical 
classes, whereby objects falling into an intersection between 
given critical and less critical classes will be assigned the 
more critical class. 

Considering classes and features that are preferably 
handled by the RBCl, i.e., the initial rule-based classifier 
module of the classifier system. Table VI below lists three 
example classes of different types of oil spots and corre- 
sponding features for each class for the RBCl for an 
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application in which steel strip defects are detected and 
classified. These classes are very common for the example 
application, and thus are preferably always implemented on 
the RBCl and not the TBC or the other rule-based classi- 
fiers. Because none of the three oil spot defects is critical for 
the quaUty of the product in the example, and instead are 
used to monitor the production line quality, they are in the 
example all grouped as one and the same class of "oil ^ot." 

TABLE VI 



Class Name 


Feature 


Feature test vahie 


Oil Spot 


Repeatiiig Defect 


False 




Low Grayscale 


70-110 




High Grayscale 


80-140 




Tbtal Area 


^2.5 »q. in. 




Box Height 


^1.0 in. 




HdghtAVidth Ratio 


25-30 


on Spot 


Repeating Defect 


False 




High Grayscale 


120-1 45 




Low Grayscale 


105-135 




Tbtal Area 


0.73-3.25 sq. in. 




Heighl/Width Ratio 


0.8-1.3 


Oa Spot 


Tbtal Area 


<0.002 sq. in. 




Repeating Defect 


False 




Average Grayscale 


100-160 




Low Grayscale 


>100 




High Grayscale 


<140 



Table VII below illustrates other examples of types of 
classes that are effectively configured in the RBCl, again for 
the example of classifying material defects, here steel 
defects. In the first example shown, the class "Ignore" is 
assigned to all defects with a bounding box bottom of less 
than 1 foot. Because the first foot of production material is 
in this example trim area, all defects in this region can be 
ignored. With this class definition, the RBCl screen out 
areas of material that need not be considered. 

The other example in the table is a class called "weld 
hole." In the example apphcation here, such weld holes are 
purposely made in a steel strip to identify the location at 
which a weld has been made to join two strips together. 
Because the shape, dimensions, and position of these weld 
holes are typically well-deteraiined by the produaioo line 
process, it is uncomplicated to define a rule -based test for 
detecting them. Also, if the mechanism that produces the 
holes need be changed, whereby the hole characteristic is 
correspondingly changed, it is easier to diange the rule- 
based classifier table rules rather than collecting new data to 
build a TBC training set for the adjusted holes. In the 
example, the holes are always of a circular shape, whereby 
a height-to-width ratio of close to "1" is required, the total 
area is known to always be about 1.3 square inches, and the 
horizontal center of a hole's bounding box is known to be 
located about 10 inches fi-om the left edge of the material. 



TABLE VII 



Class Name 


Feature 


Feature test value 


[gnorc 


Box Bottom 


<1 ft 


Weld hole 


Eleighl/Width Ratio 


0.950-1 .050 




Tbtal Area 


1.2-1.4 sq. in. 




Box X Center 


9.950-10,050 in. 



Considering the other rule-based classifier modules, recall 
that the RBC3 module is implemented with those classes not 
in the RBCl and that have yet to be implemented in the TBC 
or that are to be perpetually maintained in the RBC3 module 
for, e.g., classifying newly produced objects. As can be 
imderstood, a wide range of classes can thus be defined in 
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the RBC3, with no particular limit or requirement. The 
RBC2 classifier module is implemented with subclasses for 
refining the classification of the TBC, with validation tests 
for invalidating dubious TBC classification results, and with 
inter-class queries for making more precise classification 
decisions between two possible classes, as explained above. 

Table VIII below lists three example classes and their 
features the RBC2 for the example appHcation in which steel 
strip defects are detected and classified. The first two 
example classes, "lightning bolt-medium" and "lightning 
bolt-severe," are actually subclasses of the main class "light- 
ning bolt/* which can be configured in the TBC. These two 
subclasses divide those defects classified into the class 
"hghtning bolt" into the "medium" subclass for lightning 
bolt defects of between 1-2 sq, indies, and into the "severe" 
subclass for lightning bolt defects of greater than 2 sq. 
inches. Note that the highest-probability class ascertained by 
the TBC is indicated. 

In the third example of the table, refinement of a ques- 
tionable TBC classification is accomplished. Here, two 
classes, namely, "sliver** and "oil streak," are known to in 
some cases be confused for each other by the TBC because 
these two classes look very much alike. However, it is 
known that the defect "sliver** is more severe than that of 
"oil streak," and should be weighted more heavily in a 
questionable classification between the two classes. Here, 
the RBC2 module is configured to definitively classify a 
defect as "sliver" whidi was classified by the TBC as "oil 
streak" when the second highest TBC classification prob- 
ability was associated with the "sliver" classification, with 
the highest TBC classification probability being associated 
with "oil spot** but that probability being less than 1%. 

TABLE VIII 



Class Name 




Feature 


Feature test value 


1 .ightning Bolt - 


Medium 


1st TBC Class 


lightning Bolt 






Ibtal Area 


1-2 sq. in. 


Lightning Bolt - 


Severe 


Ist TBC Oass 


Lightning Bolt 






Total Area 


>2 sq. in. 


Sliver 




2nd TBC Class 


Sliver 






Ist TBC Class 


Oil Streak 






1st TBC probability 


<.01 



explicit class probability function cannot be formulated for 
such an application. In this case, in accordance with the 
invention, class probability functions are produced based on 
approximations of histogram distributions of the probabili- 

5 ties for values of the features in the class. 

As explained earlier, before a training-based classifier 
module is configured in the classifier of the invention, the 
rule-based RBCl classifier module can be employed to 
collect classification information and parse data based on the 

10 rule-based classification. As this goes on, a system operator 
can then manually classify feature samples produced by the 
RBCl to produce a training set of samples having known 
and correct classification assignments. For example, consid- 
ering the basic RBCl rule-based classification table pro- 

15 vided by the system for initial implementation of the RBCl, 
as in Table IV above, a system operator can examine the 
defects classified by this RBCl implementation as "huge 
fight" defects and collect all of those that are "huge hole" 
defects, thereby making appropriate classification determi- 

20 nations. 

It is this resulting sample classification data that is used in 
the training process to formulate probability functions for 
the classes to be handled by the training-based classifier. 
Specifically, in the training process, for each defined class in 

25 the training set and for each selected feature of the samples 
in the training set, an array each of T integers is allocated in 
memory and initialized to zero, to store data for a probabi- 
listic histogram to be built based on the training samples. 
Then data from each of the samples of the training set is 

30 examined, and for each defined feature, the minimum value, 
^miny found for that feature, and the maximum value, F„,^, 
found for that feature among all of the samples are deter- 
mined across all of the classes and stored in an array. 
Then, data from each of the training samples is again 

35 examined, and for a given sample, for each feature an index, 
k, is computed to be used as an index in determining a 
histogram element for that feature for the sample's class, as 
follows: 



(1) 



T\iraing now to an example implementation of the TBC, 
this automatic classifier is in one embodiment based on 
probability functions that indicate probabilities of an object 
having a given feature belonging to a given class. The base 
of knowledge on which the TBC makes classification deci- 
sions is here formed of a set of probabiUty functions for the 
classes of the set of examples provided in training the 
classifier. 

In the training process, a probability function is produced 
for each class based on one feature set permutation, with a 
probability function for each possible permutation produced 
for each class. The probability function Pa(v), where 
vo(v^, . . . , v^) is a set of N features, defines the probability 
of a sample having feature set v belonging to the class A 
Knowing the probabilides of all classes, then the so-called 
Bayes* decision nile can be invoked, whereby, assuming that 
the sample belongs to the class for which P(v) is largest, the 
overall error in the classification process is known to be 
minimized. 

Typically, a probability function can be explicitly formu- 
lated for deterministic processes but not for stochastic 
processes. In the case of many classification applications 
such as defect classification, defect appearance is a 
stochastic, rather than deterministic process. Thus, an 



where F is the value of a given feature being examined for 
^5 a given sample. T is the number of elements in the histogram 

array being built for the sample's class, and k=0 to T-1. 

Each feature histogram is thusly developed by mapping 

histogram data of the range of values for each feature to a 

histogram for each class that indicates the probability of a 
50 given feature for that class. This is done by defining the k^ 

indexing element of the histogram for class A for which the 

given sample belongs by: 

//^j^*)-^^X*^)+i. (2) 

55 In a next step, the class histograms are smoothed by 
applying a gaussian filter, g(x), to non-boolean feature 
histogram data, where: 

where a is the standard deviation of the gaussian profile and 
is preferably set to be large enough to fill in gaps in the class 
65 histograms caused by, e.g., unvisited indexing elements, 
without reducing the histogram to a flat profile. Specifically, 
the standard deviation is preferably set by testing the per- 
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formancc of the classifier; any of a large range of deviation in this scenario, if a feature is important for describing 

vahics centered at about TAO can typically be used with only some classes, that feature can be weighted as "irrcl- 

good results. After the smoothing process, the histograms evant" for the other classes. In practice, when a feature is 

are normalized to eliminate weighting based on unequal weighted as being unimportant for a given class, then the 

number of samples across the classes. 5 conesponding histogram is not computed by examination of 

Wth this approach the training process produces a num- training set data as in the above description, and instead, it 

her of histograms corresponding to the product of the is set to a constant vahie. In this way, the feature distribution 

number of classes and the number of features. The number ^^j. ^^^^ ^^^^ ^ ^^^^^^^ ^^^^ ^^^^ 

of index Clements T of each histogram depends on how distributions, without the necessity of conditional tests dur- 

sparse the distributions of the feature vahi^ are. and rep- ^ classification process, which would slow down the 

resents the factor tised to normauze the feature values. ^ *fi H 

Larger values for T result in higher classifier resolution, but ci^iner spec . 

require larger values for the stTdard deviation used for the . Similarly, for the case m which a given feature is very 

smoothing process, and require significanUy more memory important for describing a class, e.g. meaning that if a 

to store than smaller values. feature under test is outside the range of the djstnbution for 

In practice, the value for T is preferably determined by a 15 the given feature then the sample whose feature under test 

given classification apphcation. The training set is thus definitively docs not belong to the class, the given feature 

preferably used to build the class histograms with a large can be weighted as "essential.'* In this case, the corrcspond- 

value for T, e.g., 512 or 1024, and then the value for T is ing histogram is not smoothed using expression (3) above, 

reduced until the performance of the classifier is foimd to and the index elements of the histogram distribution that are 

drop drastically. For example, in the case of steel defect 20 not represented in the training set are not addressed, i.e., not 

classification, setting T«32, corresponding to a normaliza- assigned a value. In this way, elements not represented in the 

tion of 5 bits, causes the performance of the classifier to drop histogram by the samples remain zero for that class. It must 

by less than about 1 percent from that when a value of 512 be here noted that because it is not possible to compute the 

is used. logarithm of zero, index elements that are not addressed are 

Assuming that the features used in classification are 25 preferably set to some large negative number that is selected 

statistically independent, the probability function of each to avoid overflow in the summation expression. 

class is in the case of multiple features computed using the Commonly, the value of a feature being tested during 

gaussian-filtercd histograms with the probability function classification is foimd to be undefined due, e.g., to a division 

for the class A is given as: by zero in computation of the value, or is found to be out of 

30 the range of the training set. In such cases, training-based 

^ ^ y/v (4) classifiers that require normalization of feature values typi- 

pl Pj.(v) . cally are inoperable. In the training-based classifier module 

j'i ) of the invention, this limitation is overcome by including 

two elements in the histogram array, namely, a first element 

, , ^. , , 35 for not valid values, and a second element for out-of-range 

Id this case, the histogram distributions are one^mensional ^^^^^ ^ ^ j^^^ ^^ ^^^^ 

arrays. ^ in the one-feature case, and the amount of data ^^^^ ^^^^^^ ^1^^^^, ^ ^ 

produced duiMg the probabiuty computation grows linearly, ^^j^^, ^^^^^^^ ^^^^^^^ ^eiehud as "normal/' to a 

not exponenttally. with the nuniber of features. This learaing ^^^^ j^^^^^^ ^^^^^ ^ "irrelevant." and to 

process has the advantage of bemgqmtesmiple, and does ^ j^^,^^^^ ^^j^,^^ ^ "essential." With this 

not require a great deal of computational time. In addiUon, ement, no additional conditional tests are needed in 

unmce automatic class|flers such as neural networks, this cbssification process to avoid inoperable compuUUon 

technique is guaranteed to converge to a finite and stable scenarios. As can be recognized, similar or other feature 

"I?""'. • .u . • • L J 1 -c weighting techniques can be employed with a selected 

WiUi this traimng sequence, the trammg-based classifier t,^^ie° lassifler other than probabilisUc classifleis. 

classifies an object by transforming an array of features mto ,j ^ ^^^^ ^.^^^^^^^ ^ ^ recognized, the 

the index array given by expression (1) above, and then the ^^^^ implemenuUons of rule-based and training-based 

probabdities for "ch class ^e computed using th«= appjoxi- ^^^^^ ^^^^^^ ^ ,^ „j ^ wide-range 

mate expression (4) above. Preferably, expre^on (4) above ^j^^^ implementations that are suitable and 

B computed usmg a loganthmic e;q,ression of Uiat fimcUon ^ ^ contemplated for the classifier of the invention, 

such that only addition, not multiphcatton, of the probabdi- ^.^^^ „j ^.^^ classifier 

ties IS reqmred. as follows: implementations, the output of the classifier is a set of 

objects having assigned classifications and optionally, in 

]*'^ ^ n ] addition, a set of objects having no class assigned to them, 

lo^(v)) = lod 1 j Ff{v)\ ^)^^j\o^Pfiv))^. j^^g assigned the class "unknown." Recall that the 

' term "object/* as used throughout the description, is 

intended to refer to distinguishable characteristics of an 

Conventionally, training-based classifiers cannot dis- article to be classified. As explained earlier, the classifier of 

criminate which features are best used in describing various the invention can be configured in a system that enables 

classes; indeed in some appUcations it is important to use 60 various post-classification processing of the classification 

specific features only for some classes, while some features data. For example, a workstation or other computer proces- 

may be exclusive for other classes. In the probabilistic sor can be employed to build histogram plots of classifica- 

trainable classifier provided by the invention, the importance tion information for a given application, e.g., classified 

of features can be weighted to indicate a desired importance defects on a web material, with the histogram plots being 

for the features. In one example weighting tedinique, one of 65 displayed to indicate the distribution and severity of defects 

e.g., three weights, representing "normal," "irrelevant," and across the web material. Suitable apparatus can be connected 

"essential/' can be assigned to each feature. to the system for, e.g., correspondingly marking defective 
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material under inspection, mechanically rejecting defective 
material £rom a manufacturing sequence line, sounding an 
alarm, or other functions. 

From the foregoing, it is apparent that the automatic 
classifier piu^ant to the invention and described above not 
only provides reliable and efficient classification of article 
diaracteristics, but docs so employing reconfigurable rule- 
based and training-based classifier modules that enable 
exploiting the performance strengths of each module while 
at the same time compensating for performance weaknesses. 
It is recognized, of course, that those skilled in the art may 
make various modifications and additions to the embodi- 
ments described above without departing from the spirit and 
scope of the present contribution to the art. Accordingly, it 
is to be understood that the protection sought to be afforded 
hereby should be deemed to extend to the subject matter 
claims and all equivalents thereof fairly within the scope of 
the invention. 

We claim: 

1. A classification system for classifying the diaracteris- 
tics of an article into classes of characteristics based on 
features assodaated with the characteristics, the system 
comprising: 

a first rule-based classifier providing rules that embody 
requirements of selected features so as to together 
define selected classes, the rules to be applied to 
measured features of an article characteristic to assign 
one of the selected classes to that characteristic, should 
the article diaracteristic meet the feature requirements 
of one of the selected classes; and 

a trainable classifier that is configured based on a training 
set of preclassified characteristics and a corresponding 
training class set which can include characteristic 
classes that are not the selected classses of the first-rule 
based classifier, the trainable classifier connected so as 
to accept fi*om the first rule-based classifier an article 
characteristic not classified by the first ruled-based 
classifier, to assign one of the training set classes to thea 
characteristic, should the trainable classifier configura- 
tion associate that characteristic with one of the training 
set classes. 

2. A classification system for classifying charateristics of 
an article into classes of characteristics based on features 
associated with the characteristics, the system comprising: 

a trainable classifier configured based on a training set of 
preclassified characteristics for assigning one of the 
training set classes to an article characteristic, should 
the trainable classifier configiuation associate that char- 
acteristic with one of the training set classes; and 

a first rule-based classifier connected so as to accept finm 
the trainable classier a classified article characteristic 
for refining classification of that characteristic, the first 
rule-based classifier providing refinement rules 
embodying requirements of selected refinement fea- 
tures so as to together define refinement classes, the 
refinement rules to be applied to measured features of 
the classified article characteristic for assignment one 
of the refinement classes to the classified characteristic, 
should the classified characteristic meet the refinement 
feature requirements of one of the refinement classes. 

3. A classification system for classifying characteristics of 
an article into classes of characteristics based on features 
associated with the characteristics, the system comprising: 

a trainable classifier configuxed based on a training set of 
preclassified characteristics for assigning one of the 
training set classes to an article characteristic, should 
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the trainable classifier configuration associate that char- 
acteristic with one of the training set classes; and 
a first rule -based classifier connected so as to accept from 
the trainable classifier an article characteristic not clas- 
sified by the trainale classifier, the first rule-based 
classifier providing rules that embody requirements of 
selected features to as to together define selected 
classes, the rules to be appUed to measured features of 
the unclassified article characteristic for assigning one 
10 of the selected classes to the imclassified characteristic, 
should the unclassified article characteristic meet the 
feature requirements of one of the selected classes. 

4. A classification system for classifying characteristics of 
an article into classes of characteristics based on features 

15 associated with the characteristics, the system comprising: 
a first rule-based classifier providing rules that embody 
requirements of selected features so as to together 
define selected classes, the rules to be applied to 
meastu'ed featxires of an article characteristic to assign 

^ one of the selected classes to that characteristic, should 
the article characteristic meet the feature requirements 
of one of the selected classes; 
a trainable classifier configured based on a training set of 
preclassified characteristics, the trainable classifier 

^ connected so as to accept from the first rule-based 
classifier an article characteristic not classified by the 
rule based classifier, to assign one of the training set 
classes to that characteristic, shoiild the trainable clas- 

^ sifier configuration associate that characteristic with 
one of the training set classes; and 
a second rule -based classifier connected so as to accept 
fi'om the trainable classifier an article characteristic not 
classified by the trainable classifier and the first rule- 

2j based classifier, the second rule-based classifier pro- 
viding rules that embody requirements of selected 
features so as to together define selected classes, to be 
applied to measured features of the unclassified article 
charaaeristic for assigning to the unclassified charac- 

^ . teristic one of the selected classes, should the unclas- 
sified characteristic meet the feature requirements of 
one of the selected classes. 

5. A classification system for classifying characteristics of 
an article into classes of characteristics based on features 
associated with the characteristics, the system comprising: 

a first-rule based classifier providing rules that embody 
requirements of selected features so as to together 
define selected classes, the rules to be applied to 
measured features of an article characteristic to assign 

50 one of the selected classes to that characteristic, should 
the article characteristic meet the feature requirements 
of one of the selected classes; 
a trainable classifier configured based on a training set of 
preclassified characteristics, the trainable classifier 

55 connected so as to accept from the first rule-based 
classifier an article characteristic not classified by the 
rule based classifier, to assign one of the training set 
classes to that characteristic, should the trainable clas- 
sifier configuration associate that characteristic with 

60 one of the training set classes; and 

a second rule-based classifier connected so as to accept 
from the trainable classifier a classified article charac- 
teristic for refining classification of that characteristic, 
the second rule-based classifier providing refinement 

65 rules that embody requirements of selected refinement 
features so as to define refinement classes, the refine- 
ment rules to be applied to measured features of the 
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classified article characteristic to assign one of the 
refinement classes to the classified characteristic, 
should the classified characteristic meet the refinement 
feature requirements of one of the refinement classes. 

6. The classification system of claim 1 wherein the 5 
selected classes defined by rules of the first rule -based 
classifier comprise classes expected to be frequently 
assigned to article charactenslics of interest and classes for 
which assignment to characteristics of interest, based on 
rigid feature requirements, is expected to be accomplished lO 
with at least a minimum prespccified level of confidence. 

7. The classification system of any of claims 1, 2, or 3, 
further comprising means for choosing the features of the 
preclassified characteristic training set based on a compari- 
son of percentage of training set characteristics that arc 15 
associated by the trainable classifier with the class known to 

be correct for each respective training set characteristic, the 
comparison carried out as each possible feature is added to 
a set of possible features, each feature being added in an 
order beginning with a feature known to provide a highest 20 
degree of correctness in association by the trainable classi- 
fier with the charactcrisitic training set, based on that 
feature, any possible feature for which comparison of the 
association percentage of the training set using a current 
feature accumulation in the set of possible features is less 25 
than an association percentage for a just preceeding feature 
accumulation in the set being indicated as not to be chosen 
as a feature of the preclassified characteristic training set. 

8. The classification system of any of claims 1, 2, or 3, 
further comprising means for generating the measured fea- 30 
tures of characteristics of the article. 

9. The classification system of claim 8 further comprising 
means for acquiring characteristic information from the 
article to be examined for generating the measured features. 

10. The classification system of any of claims 1, 2, or 3, 35 
wherein the rules provided by the first rule-based classifier 
are configured as a classification table, with each classifi- 
cation table row corresponding to a selected class and each 
column of a given class row corresponding to a feature 
requirement for that class. 40 

U. The classification system of claim 5 further compris- 
ing a third rule-based classifier cormected so as to accept 
from the trainable classifier an article characteristic not 
classified by the trainable classifier and the first rule-based 
classifier, the third rule-based classifier providing rules that 45 
embody requirements of selected features, so as to together 
define selected classes, the rules to be applied to measured 
features of the unclassified article characteristic for assign- 
ing one of the selected classes to the unclassified 
characteristic, shoiild the unclassified characteristic meet the 50 
feature requirements of one of the selected classes. 

12. The classification system of any of claims 4, 11, or 3, 
wherem a plurality of rules provided by the rule -based 
classifier that is connected so as to accept from the trainable 
classifier an unclassified article characteristic relate to a 55 
prespccified minimum degree of confidence of classificaiton 
assignment by the trainable classifier, one degree of confi- 
dence of each of a corresponding plurality of selected 
classes, a corresponding minimum degree of confidence 
required for assignment of a given one of the plurality of 60 
selected classes to the unclassified article characteristic. 

13. The classification system of any of claims 1, 2, or 3, 
wherein the trainable classifier is configured based on prob- 
abihty functions that are constructed based on probability of 
association by the trainable classifier of each characteristic 65 
of the preclassified characteristic training set with a corre- 
sponding known respective class for that characteristic. 
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14. The classification system of claim 13 wherein at least 
one feature of characteristics in the training set is weighted 
for each class in the training set as a function of degree of 
criUcality of that feature in correct association of a charac- 
teristic with that class. 

15. The classification system of either of claims 2 or 5, 
wherein the refinement characteristic classes comprise char- 
acteristic subclasses of at least one training set class of the 
trainable classifier. 

16. The classification system of cither of claims 2 or 5, 
wherein the refinement characteristic classes comprise 
classes defined by confirmational feature requirements for 
confirming vahdity of trainable classifier class associations. 

17. The classification system of cither of claims 2 or 5, 
wherein the refinement characteristic classes comprise 
classes defined by clarification feature requirements for 
clarifying a trainable classifier class association between two 
classes for which confusion between the two classes is 
possible. 

18. The classification system of any of claims 4, 11 or 3, 
wherein the rules provided by the rule-based classifier that 
is connected so as to accept an unclassified article from the 
trainable classifier are configured as a classification table, 
with each classification table row corresponding to a 
selected class and each column of a given class row corre- 
sponding to a feature requirement for that class, the classes 
selected for enabling classification of article characteristics 
tmdefined by another classifier in the classification system, 

19. The classification system of claim 2 further compris- 
ing: 

a second rule-based classifier connected so as to accept 
from the trainable classifier an article diaracteristic not 
classified by the trainable classifier, the second rule- 
based classifier providing rules that embody require- 
ments of selected features to as to together define 
selected classes, the rules to be applied to measured 
features of the unclassified article characteristics for 
assigning one of the selected classes to the unclassified 
characteristic, should the unclassified article character- 
istic meet the feature requirements of one of the 
selected classes. 

20. An image classification system comprising: 
feature extraction means responsive to an image of an 

article to produce feature information about character- 
istics of the article for classification of the character- 
istics into classes; 

a first rule-based classifier providing rules that embody 
neqiiirements of selected features so as to together 
define selected classes, the rules to be applied to the 
extracted feature information of a given characteristic 
for assigning one of the selected classes to that 
characteristic, should the article characteristic meet the 
feature requirements of one of the selected classes; 

a trainable classifier configured based on a training set of 
preclassified characteristics, the trainable classifier 
connected so as to accept from the first rule-based 
classifier an article characteristic not classified by the 
rule based classifier, to assign one of the training set 
classes to that characteristic, should the trainable clas- 
sifier configuration associate that characteristic with 
one of the training set classes; 

a second rule-based classifier connected so as to accept 
from the trainable classifier a classified article charac- 
teristic for refining classification of that characteristic, 
the second rule-based classifier providing refinement 
mles that embody requirements of selected refinement 
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features so as to together define refinement classes, the 
refinement rules to be applied to extracted features of 
the classified article characteristic to assign one of the 
refinement classes to the classified characteristic, 
should the classified characteristic meet the refinement 5 
feature requirements of one of the refinement classes; 
and 

a third rule-based classifer connected so as to accept form 
the trainable classifier an article characteristic not clas- 
sified by the trainable classifier and the first nile-based 10 
classifier, the third rule-based classifier providing rules 
that embody requirements of selected features so as to 
together define selected classes, the rules to be applied 
to extracted features of the unclassified article charac- 
teristic to assign one of the selected classes to that 
characteristic, should the article characteristic meet the 
feature requirements of one of the selected classes. 

21. A method for classifying characteristics of an article 
into classes of characteristics based on features associated 
with the characteristics, the method comprising: 20 

applying rules, that embody requirements of selected 
features so as to define selected classes, to measured 
features of an article characteristic to be classified; 

assigning one of the selected classes to the article ^ 
characteristic, should the characteristic meet the feature 
requirements of the rules for one of the selected classes; 
and 

providing a characteristic that is not assigned one of the 
selected classes to the trainable classifier that is con- 30 
figured based on a hainiog set of prcclassified charac- 
teristics and a corresponding training class set which 
can include characteristic classes that are not the 
selected classes, the trainable classifier assigning one of 
the training set classes to the unclassified characteristic, 35 
should the trainable classifier configuration associate 
that characteristic with one of the training set classes. 

22. A method for classifying characteristics of an article 
into classes of characteristics based on features associated 
with the characteristics, the method comprising: 4Q 

applying rules, that embody requirements of selected 
features so as to define selected classes, to measured 
features of an article characteristic to be classified; 

assigning one of the selected classes to the article 
characteristic, should the characteristic meet the feature 
requirements of the rules for one of the selected classes; 
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providing a characteristic that is not assigned one of the 
selected classes to a trainable classifier that is config- 
ured based on a training set of prcclassified 
characteristics, the trainable classifier assigning one of 
the training set classes to the unclassified characteristic, 
should the trainable classifier configuration associate 
that characteristic with one of the training set classes; 

accepting an unclassified article characteristic from the 
trainable classifier, 

applying to measured feamres of the unclassified charac- 
teristic rules that embody requirements of selected 
features so as to together define selected classes; and 

assigning one of the selected classes to the unclassified 
characteristic, should the unclassified characteristic 
meet feature requirements of the rules for one of the 
selected classes. 

23. A method for classifying characteristics of an article 
into classes of characteristics based on features associated 
with the characteristics, the method comprising: 

applying rules, that embody requirements of selected 
features so as to define selected classes, to measured 
features of an article characteristic to be classified; 

assigning one of the selected classes to the article 
characteristic, should the characteristic meet the feature 
requirements of the rules for one of the selected classes; 

providing a characteristic that is not assigned one of the 
selected classes to a trainable classifier that is config- 
ured based on a training set of prcclassified 
characteristics, the trainable classifier assigning one of 
the training set classes to the unclassifed characteristic, 
should the trainable classifier configuration associate 
that characteristic with one of the training set classes; 

accepting a classifed article characteristic fi-om the train- 
able classifier; 

applying to measured features of the classified character- 
istic refinement rules that embody requirements of 
refinement features so as to together define refinement 
classes; and 

assigning one of the refinement classes to the classified 
characteristic, should the classified characteristic meet 
the refinement feature requirements of the refinement 
rules for one of the refinement classes. 

* * * * * 
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